Picture for Xiaopeng Li

Xiaopeng Li

Efficient Exploration for Iterative Nash Preference Optimization

Add code
May 31, 2026
Viaarxiv icon

When Hard Negatives Hurt: Bridging the Generative-Discriminative Gap in Hard Negative Synthesis for Retrieval

Add code
May 31, 2026
Viaarxiv icon

Reinforced Preference Optimization for Reasoning-Augmented Recommendations

Add code
May 21, 2026
Viaarxiv icon

Personalized Deep Research: A User-Centric Framework, Dataset, and Hybrid Evaluation for Knowledge Discovery

Add code
May 11, 2026
Viaarxiv icon

From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space

Add code
Apr 28, 2026
Viaarxiv icon

GeoRouter: Dynamic Paradigm Routing for Worldwide Image Geolocalization

Add code
Mar 25, 2026
Viaarxiv icon

High-Slip-Ratio Control for Peak Tire-Road Friction Estimation Using Automated Vehicles

Add code
Mar 10, 2026
Viaarxiv icon

To Search or Not to Search: Aligning the Decision Boundary of Deep Search Agents via Causal Intervention

Add code
Feb 03, 2026
Viaarxiv icon

Reward-free Alignment for Conflicting Objectives

Add code
Feb 02, 2026
Viaarxiv icon

Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation

Add code
Jan 29, 2026
Viaarxiv icon